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Abstract 

We define a class of Z-interference channels for which we obtain a new upper bound 
on the capacity region. The bound exploits a technique first introduced by Korner and 
Marton. A channel in this class has the property that, for the transmitter-receiver pair 
that suffers from interference, the conditional output entropy at the receiver is invari- 
ant with respect to the transmitted codewords. We compare the new capacity region 
upper bound with the Han/Kobayashi achievable rate region for interference channels. 
This comparison shows that our bound is tight in some cases, thereby yielding spe- 
cific points on the capacity region as well as sum capacity for certain Z-interference 
channels. In particular, this result can be used as an alternate method to obtain sum 
capacity of Gaussian Z-interference channels. We then apply an additional restric- 
tion on our channel class: the transmitter-receiver pair that suffers from interference 
achieves its maximum output entropy with a single input distribution irrespective of 
the interference distribution. For these channels we show that our new capacity re- 
gion upper bound coincides with the Han/Kobayashi achievable rate region, which is 
therefore capacity-achieving. In particular, for these channels superposition encoding 
with partial decoding is shown to be optimal and a single-letter characterization for 
the capacity region is obtained. 



Index terms: Capacity, interference channel, one-sided interference channel, Z-interference 
channel 
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1 Introduction 



The interference channel, introduced in [1], is a simple network consisting of two pairs of 
transmitters and receivers. Each pair wishes to communicate at a certain rate with negligi- 
ble probability of error. However, the two communications interfere with each other. The 
problem of finding the capacity region of the interference channel is difficult and therefore 
remains essentially open except in some special cases, e.g., interference channels with statis- 
tically equivalent outputs [2-4], discrete additive degraded interference channels [5], a class 
of deterministic interference channels [6], strong interference channels [7-11], and a class of 
degraded interference channels [12]. 

While the capacity region of the interference channel remains largely unknown, achievable 
rate regions and capacity region upper bounds have been found. To date, techniques for 
obtaining achievable rate regions for the interference channel include treating interference as 
noise [2], fully decoding the unwanted interference [3], superposition encoding and partially 
decoding the unwanted interference [4,9], and time division multiplexing/frequency division 
multiplexing [3,4]. Techniques for upper bounding the capacity region of the interference 
channel include allowing transmitters and receivers to cooperate without requiring additional 
bandwidth or power [3], introducing an imaginary but more capable receiver into the system 
[13], and genie-aided receivers [6,14-16]. 

The Z-interference channel is an interference channel where one transmitter-receiver pair 
is interference- free. Though finding the capacity region of the Z-interference channel is a 
simpler problem than that of the interference channel, capacity results are still limited, 
with the following exceptions: The Z-interference channel capacity region is known when 
the interference is deterministic [6, Section IV]. The Z-interference channel sum capacity 
is known when the Z-interference channel is Gaussian [17], or when the interference-free 
link is also noise-free [18], although in both cases, the full capacity region has not been 
characterized. 

In this paper, we first provide a new capacity region upper bound for a class of Z- 
interference channels that satisfy a given condition: namely that the performance of the 
transmitter-receiver pair that suffers from interference depends only on the distance of the 
transmitter's codewords and not their exact locations. The technique [19, page 314] that we 
use in obtaining the converse associated with this upper bound was introduced by Korner 
and Marton in [20]. This technique has been useful in the solution of several problems in 
multi-user information theory, in particular for broadcast channels with degraded message 
sets [21], communication where the transmitter has non-causal perfect side information, i.e., 
the Gelfand-Pinsker problem [22], and semicodes for the multiple access channel [18]. We 
then compare the new capacity region upper bound with the Han/Kobayashi achievable 
rate region [9]. This comparison yields certain points on the capacity region, including the 
sum-rate point, for channels in our defined class. 
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We next add an additional condition to our channel class, namely that for the transmitter- 
receiver pair that suffers from interference, the maximum output entropy can always be 
achieved regardless of the choice of the interferer's codebook. With this additional condition 
we show that our capacity region upper bound coincides with the Han/Kobayashi achievable 
rate region, and hence equals the capacity region for this subclass of Z-interference chan- 
nels. The Han/Kobayashi achievable rate region is obtained using the idea of superposition 
encoding and partial decoding. Specifically, the transmitters are required to encode their 
messages via superposition encoding, and each receiver is required to decode not only its 
own message, but also part of the interference. One of the main difficulties in finding the 
capacity region of the interference channel is to justify the need for partial decoding in the 
converse proofs, as the partially decoded information is not required at the receiver. Using 
our new capacity region upper bound, we are able justify the optimality of superposition 
encoding and partial decoding for a subclass of Z-interference channels. So far, the only 
result that proves superposition encoding and partial decoding is optimal is [6] where, due 
to the deterministic nature of the channel, the capacity region upper bound obtained by 
using the technique of genie-aided receivers is sufficient to meet the achievable rate region 
of [9]. Using a different upper bounding technique, we show that superposition encoding 
and partial decoding is optimal for certain Z-interference channels, which are not necessarily 
deterministic and, therefore, provide a single-letter characterization for the capacity region 
of these channels, which was previously unknown. 

The remainder of this paper is organized as follows. In Section El we define the class 
of Z-interference channels investigated in this paper by providing two conditions on the 
channels. In Section [3J we provide a new capacity region upper bound for Z-interference 
channels satisfying the first conditon. We restate the Han/Kobayashi achievable rate region 
for Z-interference channels in Section HI In Section [5j points on the capacity region as well as 
sum capacity for certain Z-interference channels are found by comparing our capacity region 
upper bound with the Han/Kobayashi achievable rate region. In Section [6j we provide the 
single-letter characterization of the capacity region for Z-interference channels that satisfy 
both conditions. This is followed by our conclusions in Section Details of certain proofs 
are given in the Appendix in Section [HJ 



Consider a Z-interference channel with two transition probabilities p(y i\xi) and p^l^ii #2)- 
The input and output alphabets are Xi, X 2 , y± and 3^2- Set 



2 System Model 






(2) 
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Let Wi and W 2 be two independent messages uniformly distributed on {1, 2, • • • , Mi} and 
{1, 2, • • • , M 2 }, respectively. Transmitter i wishes to send message Wi to Receiver i, i = 1, 2. 
An (Mi, M 2 , n, e n ) code for this channel consists of a sequence of two encoding functions 

/ f n :{l,2,...,2"*}->A?, * = 1,2 (3) 

and two decoding functions 

tf:3?-{l,2,... ,2**}, i = l,2 (4) 

with probability of error 

en = max J A^-J2 F ^9?(Yn^w i \W 1 = w u W 2 = w 2 }. (5) 

A rate pair (Ri,R 2 ) is said to be achievable if there exists a sequence of (2 nRl , 2 nR2 , n, e n ) 
codes such that e n — > as n — > 00. The capacity region of the Z-interference channel is the 
closure of the set of all achievable rate pairs. 

An example of the Z-interference channel is the Gaussian Z-interference channel, where 
X\ = X 2 = y% = y 2 = R, and p(yi\xi) and p(y 2 \xi, x 2 ) are given as 

Y 1 = X 1 + Z 1 , (6) 
Y 2 = aX x + X 2 + Z 2 , (7) 

where Z\ and Z 2 are independent Gaussian random variables with zero mean and unit 
variance, oGl, and the channel inputs have to satisfy the average power constraints of Pi 
and P 2 . 

The class we investigate in this paper consists of Z-interference channels that satisfy : 

Condition 1: For any n = 1, 2, • • • , H{Y 2 n \^ 2 = ^2); when evaluated with the distribution 
P( x i)p(y 2 \ x i 5 x 2 )i i s independent of x 2 for any 

Condition 1 specifies that the channel p(y 2 \xi, x 2 ) is invariant, in terms of conditional 
output entropy, with respect to the input sequence of Transmitter 2, i.e., x 2 . This means 
that when designing the codebook of Transmitter 2, the exact locations of the codewords 
do not affect the performance, rather, it is the relative locations, or "distances", between 
codewords that matter. For example, the Gaussian Z-interference channel, defined in ([6]) 
and (jTj), satisfies this condition. 
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Define r as 



t = max H(Y 2 ). 

p(x 1 )p(x 2 ) 
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The class of Z-interference channels for which we are able to obtain the capacity region 
satisfy Condition 1 as well as the following condition. 

Condition 2: There exists a p*{x2) such that if (Y2), when evaluated with the distribution 

J2 Xl ,x 2 p( x i)p*( x 2)p(y2\ x i> x 2)> is ec i ual to r for an y p( x i)- 

Intuitively, Condition 2 specifies that no matter how tightly packed the codewords in 
codebook 1 are, by spacing out the codewords in codebook 2, we can always fill up the 
entire, or maximum, output space at Receiver 2. This means that using an i.i.d. generated 
codebook with p*(x2) at Transmitter 2 is to our advantage, as the larger the output space, 
the more codewords of Transmitter 2 we can pack in the space. Note that the Gaussian 
Z-interference channel does not satisfy this condition, since the largest output space is only 
achieved when both p(xi) and p{x2) are Gaussian with variances Pi and P2, respectively. 
The largest output space cannot be achieved with a p*(x2) that is irrespective of p(x\), as 
specified in Condition 2. 

Now, we give an example of a channel, shown in Figure HJ where both conditions are 
satisfied. Let X 2 = 3^2 = S = {0, 1, 2, • • • , q — 1}, where q is an arbitrary interger. Let 
sets Xi, y%, and probability distributions p(yi\xi) and p(s\xi) be arbitrary. The channel is 
defined as Vi(yi\xi) = p(yi\xi), and V^^l^i, £2) is defined as ^2 seS p(y2\s,X2)p(s\xi), where 
P(y2\s,x 2 ) is given by 



In ([9]), © is the mod-g sum. It is easy to see that the channel thus defined satisfies Condition 
1. By letting p*(x 2 ) be the uniform distribution on {0, 1,2, • • ■ , q — 1}, we find that the 
channel also satisfies Condition 2, where r = log 1 3^2 1 = logg. 

The above example is related to several Z-interference channels that have been studied 
in [5,6,16], as we now describe in more detail. 

1. The discrete additive degraded interference channels studied in [5], using similar tech- 
niques as [23, Fig. 6], can be shown to be equivalent to, or in other words, have the 
same capacity region as, the following Z-inteference channel: 



Y 2 = S®X 2 . 



(9) 



Y 1 = X 1 ® Z 



(10) 

(11) 
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P Yi|X, 






Figure 1: Example of a Z-interference channel satisfying Conditions 1 and 2 



The Z-interference channel characterized by ( flOl) and ( fTTj) is a special case of our 
example. The derivation of the capacity region in [5] relies on the degradedness of 
output Y 2 with respect to Yi, which makes treating interference as noise optimal. In our 
example, we do not make an assumption on degradedness, and show that superposition 
encoding and partial decoding is optimal. 

2. The example shown in [6, Fig. 3] is a special case of our example, with X 2 = y 2 — 
{0, 1} = S = and X x = {0, 1, 2}, p{y\\xi) is given as 



p(yik0 = 

and p(s\xi) is given as 
p{s\xi) = 



1 

e 
1 




if (xi,yi) 
if (xiiVi) 
if Oi,2/i) 
otherwise 



(0,0) or (xuyt) 

(2,0) 

(2,1) 



(1,1) 



(12) 



if (xi, s) = (0, 0) or (xi, s) = (1, 1) or (x±, s) 
otherwise 



(2,0) 



(13) 



which is deterministic. As mentioned before, the capacity region of this channel is 
found in [6] by matching the achievable rate region of superposition encoding and 
partial decoding and the capacity region upper bound of genie-aided receivers. The 
deterministic nature of p^l^i, x 2 ) plays an important role in obtaining the results. In 
our example, we make no assumption on the channel being deterministic, and use a 
different upper bounding technique to match the achievable rate region of superposition 
encoding and partial decoding. 

3. For a class of interference channels, [16] quantifies the gap between the achievable rate 
region of superposition encoding and partial decoding and the capacity region upper 
bound obtained using the technique of genie-aided receivers. Consider a special case 



6 



of the class of interference channels considered in [16], where p(s 2 \x 2 ) in [16, Fig. 1] is 
independent of x 2 and the deterministic function f 2 in [16, Fig. 1] is the modulo sum 
operation. This special case is contained in our example. Hence, for this special case, 
the results in this paper provide the exact capacity region, and we may conclude that 
the achievable rate region of Han and Kobayashi used in [16] is in fact optimal, while 
the capacity region upper bound in [16] is not tight. 

As a final remark in this section, note that in general, Condition 1 can not be verified 
in a "single-letter" way, i.e., by checking for only the case of n — 1. However, for many 
channels, such as the Gaussian Z-interference channel in fl6]) and ([7j) and the example given 
in Figured], Condition 1 is straightforward to verify. 

3 New Capacity Region Upper Bounds 

In this section, we obtain a new capacity region upper bound for the class of Z-interference 
channels that satisfy Condition 1. 

Before we proceed, we will first restate the converse technique introduced in Korner 
and Marton [20] . The technique is a method of writing the difference between two n- letter 
entropies into a sum of differences between conditional entropies, which may then be written 
as the difference between two single letter conditional entropies by defining the appropriate 
auxiliary random variables. 

Lemma 1 [19, page 314, eqn (3.34)] 

For any n, and any random variables Y n and Z n , we have 



H(Z n ) - H(Y n ) = {H{Z t \Y^\ Z i+1 , Z l+2 , ■ ■ ■ , Z n ) - H^Y*- 1 , Z i+1 , Z l+2 , ■ ■ ■ , Z n )) . 



Now, we state our upper bound for the class of Z-interference channels that satisfy Con- 
dition 1. 

Theorem 1 For a Z-interference channel, characterized by transition probabilities V\ and 
V%, that satisfies Condition 1, if rate pair (Ri,R 2 ) is achievable, then it must satisfy 



1 1 



(14) 



Ri<H{Y 1 \U,Q)+j-H(Y 1 \X 1 ) 
R 2 <H(Y 2 \Q)-H(T\U,Q)- 1 
0< 1 <mm(I(Y 1 ;U\Q),I(T;U\Q)) 



(15) 
(16) 
(17) 
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for some distribution p(q)p(xi,u\q)p(x2\q) and number 7, where the mutual informations and 
entropies are evaluated using p(q, u, x%, x 2 ,y x , t, y 2 ) = p(q)p(xi, u\q)p(x 2 \q)Vi(yi\xi)V 2 (t\xi, x 2 ) 
V 2 (y 2 \xi, x 2 ) , and x 2 is an arbitrary element in X 2 . 

Theorem [I] is the major technical contribution of this work. However, to improve the read- 
ability of the paper, we have moved the proof of Theorem [T] to the Appendix in Section 18.11 
The key point of the proof is to define an imaginary memory less channel V 2 '■ X\ — > 3^ 2) with 
input X\ and output T, as 

%{t\x 1 ) = V 2 {t\x 1 ,x 2 ), (18) 

where x 2 is an arbitrary element in X 2 . Theorem [1] is proved by using the definition of this 
imaginary channel, Lemma [1] and Condition 1. Notice that T is a random variable that we 
created in the proof of Theorem [T] which does not exist in the actual communication system. 

4 Achievable rates for Z-interference channels 

In this section, we specialize the achievable rate region in [9], which is developed for the 
interference channel, to the Z-interference channel. More specifically, we use, as in [24], 
Fourier-Motzkin elimination and the fact that p(ui\q)p(wi\q)f(xi\ui, u>i, q) is sufficient to 
achieve all possible marginals p(wi\q)p(xi\vji, q) for % = 1, 2. By setting the auxiliary random 
variable associated with Receiver 2 to be constant, we obtain the following achievable rate 
region. 

Define V as the set of distributions p(q, ) that satisfies 

p(q, it, x u x 2 ) = p(q)p(x 1 , u\q)p{x 2 \q). (19) 

For each p e V, further define the region Qi(p) as 

Qxip) = {(R u # 2 )|#! < I{Xu Y X \U, Q) + min(/(y i; U\Q),I(Y 2 - U\X 2 , Q)), 
R 2 <I(X 2 ;Y 2 \U,Q), 

Ri + R 2 < I{X X] Y X \U, Q) + I(U, X 2 ; Y 2 \Q)}. (20) 

where the mutual informations are evaluated with p, and the given channel transition prob- 
abilities p(yi\xi) and p(y 2 \x\, x 2 ). 

Theorem 2 [9, 24] For the Z-interference channel described by V\ and V 2 , an inner bound 
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on the capacity region is 



Qi=\jQi(p). (21) 

pev 

Furthermore, the inner bound remains invariant if we impose the following constraint on the 
cardinality of the auxiliary random variables: 

|W|<|Ai| + 2, |Q|<4. (22) 



5 Comparison of achievable rates and rate upper bounds 

Since T is a random variable that we created in the proof of Theorem [1] which does not exist 
in Theorem [21 in order to compare the new capacity region upper bound with the known 
achievable rate region, we rewrite the upper bound in a different way by replacing T with 
Y 2 conditioned on X 2 , using Condition 1. In other words, the capacity region upper bound 
given in Theorem [1] is equivalent to the one given in the next theorem. 

Theorem 3 For a Z-interference channel, characterized by transition probabilities V\ and 
V 2 , that satisfies Condition 1, if the rate pair (Ri,R 2 ) is achievable, then it must satisfy 

R 1 <I(X 1 ;Y 1 \U,Q)+ 1 (23) 
R2<I(U,X 2 ;Y 2 \Q)- 1 (24) 
< 7 < min(/(r i; U\Q), I(Y 2 - U\X 2 , Q)) (25) 

for some distribution p(q)p(xi, u\q)p(x 2 \q) and number 7, where the mutual informations are 
evaluated using p(q, u, x u x 2 , y 1: y 2 ) = p(q)p(x 1 ,u\q)p(x 2 \q)V 1 (y 1 \x 1 )V 2 (y 2 \x 1 , x 2 ). 

A proof of Theorem [3] is provided in the Appendix in Section 18.21 Based on Theorem [31 
we now restate the capacity region upper bound as follows, by eliminating variable 7. The 
proof is straightforward, and thus omitted. For each peP, define the region Q {p) as 

g o (p) = {(R h RJlRi < J(X i; Y X \U, Q) + min(/(n; U\Q),I(Y 2 ; U\X 2 , Q)), 
R 2 <I(U,X 2 ;Y 2 \Q), 

Ri + R 2 < I(X 1] Y,\U, Q) + I(U, X 2 ; Y 2 \Q)}. (26) 

where the mutual informations are evaluated with p, and the given channel transition prob- 
abilities p(yi\xi) and p(y 2 \xi, x 2 ). 
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Theorem 4 For a Z-interference channel, characterized by transition probabilities V\ and 
V 2 , that satisfies Condition 1, an outer bound on the capacity region is 

Go={jGo(P)- (27) 

per 

Furthermore, the outer bound remains invariant if we impose the following constraint on the 
cardinality of auxiliary random random variables: 

|W|<|Ai| + l, |Q|<4. (28) 

Comparing the capacity region upper bound in Theorem H] and the achievable rate region 
in Theorem [21 we see that the set of allowable distributions, V, are the same, while the 
regions, Gi{p) and Q {p), look slightly different for the same p G V. More specifically, among 
the three equations that characterize Gi(p) and Q {p), two of them are exactly the same. 
Notice also the fact that, the region Q (p) is always a pentagon, while the region Qi{p) can 
be a pentagon if p satisfies I(U ;Yi\Q) > I{U ;Y 2 \Q), and a rectangle otherwise. We illustrate 
the gap between Qi{p) and Q (p) in the following two figures. If the distribution p is such 
that I(U;Y\\Q) < I(U;Y 2 \Q), the comparison between Qi(p) and Q (p) looks like Figure EJ 
On the other hand, if the distribution p is such that I(U ; Yi\Q) > I(U ; 3^|Q), the comparison 
would look like Figure [3j In this case, corner point A is achievable. 

The outer bound, Q a , is a union of pentagons, and the boundary of Q Q consists of the two 
corner points of certain pentagons. It is likely that point A of Q (po) for some po e V, will 
appear on the boundary of Q a . If po also satisfies I(U ;Yi\Q) > I{U \Yz\Q), then point A is a 
capacity point, i.e., a point on the boundary of the capacity region. We formalize this fact 
in Corollary [TJ 

Corollary 1 For Z-interference channels that satisfy Condition 1, if point A given by: 

(I(X 1 ; Yx\U, Q) + min(/(n; U\Q),I(Y 2 ; U\X 2 , Q)), 

I(U, X 2 ; Y 2 \Q) - min(/(F i; U\Q),I(Y 2 - U\X 2 , Q))) (29) 

for some distribution p G V that satisfies I(U;Yi\Q) > I(U;Y 2 \Q) happens to be on the 
boundary ofQ a , then point A is a point on the boundary of the capacity region. 

Further, by comparing the achievable rate region and capacity region upper bound, we may 
make the following statement about the sum capacity of Z-interference channels that satisfy 
Condition 1. 
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Figure 2: Comparison of Gi{p) and Q (p) when I(U;Yi\Q) < /(£/"; Y2IQ) 
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Figure 3: Comparison of Gi(p) an d Go(p) when !(£/"; YjjQ) > /(?/; I2IQ) 
Corollary 2 For Z-interference channels that satisfy Condition 1, if 

max I(X 1 ;Y 1 \U,Q) + I(U,X 2 ;Y 2 \Q) (30) 

p(q)p(x 1 ,u\q)p(x 2 \q) 

is achieved by a distribution that satisfies I(U; Yi\Q) > i/ien t/ie sum capacity is 

W- 

To demonstrate how Corollary [2] might be used, consider the Gaussian Z-interference channel, 
defined in (JBj) and ((7|), where a < 1. For any p(q)p(xi, u\q)p(x2\q) , ; Yi |Q) > J(?7 ; is 
satisfied. Hence, ( 1301) is the sum capacity. The sum capacity of the Gaussian Z-interference 
channel where a < 1 has been found in [17] using a different technique. 
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6 Capacity region of a subclass of Z-interference chan- 
nels 



As illustrated in Section [51 in general, for Z-interference channels that satisfy Condition 1, 
the capacity region upper bound, described in Theorem HJ and the achievable rate region, 
described in Theorem El are not the same. However, in this section we show that when the 
Z-interference channel satisfies the additional Condition 2, our capacity region upper bound 
and the Han/Kobayashi achievable rate region coincide, yielding the capacity region. 

In the achievable rate region derived in Theorem El if we specify Q to be a constant, 
i.e., time sharing is not employed, and if we further specify p(x 2 ) to be p*(x 2 ), defined in 
Condition 2, we obtain another achievable rate region: 

R x < I(Xi; Yt\U) + min(/(tf; Y 1 ),I(U; Y 2 \X 2 )) (31) 
R 2 < t - H(Y 2 \X 2 , U) (32) 
R 1 + R 2 < Y 1 \U)+t- H(Y 2 \X 2 , U) (33) 

for some distribution p(u)p(xi\u), where the mutual informations and entropies are evaluated 
with 

p(u,x 1 ,x 2 ,y 1 ,y 2 ) = p(u)p(x 1 \u)p*(x 2 )Vi(y 1 \x 1 )V 2 (y 2 \x 1 ,x 2 ). (34) 

In obtaining this region, we have used the fact that the Z-interference channel satisfies 
Condition 2, and therefore, we have 

H(Y 2 \U) = ^2p(u)H(X 2 \U = u) = t = H(Y 2 ). (35) 

u 

This achievable rate region is potentially smaller than that described in Theorem El due to 
the fact that we have eliminated time sharing and fixed p(x 2 ) to be a specific distribution. 

As for the upper bound, rather than starting from the capacity region upper bound 
described in Theorem HJ we study the equivalent region described in Theorem [TJ By the def- 
inition of r in (|Sj), another upper bound on the Z-interference channel that satisfies Condition 
1, which is looser than that of Theorem (H is 

R 1 <H(Y 1 \U,Q)+j-H(Y 1 \X 1 ) (36) 
R 2 <r-H{T\U,Q)- 1 (37) 
0< 1 <mm{I(Y 1 ;U\Q),I(T;U\Q)) (38) 

for some distribution p(q,u)p(xi\u,q) and number 7, where the mutual informations and 
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entropies are evaluated using p(q,u,xi,yi,t) = p(q, u)p{x\\u, q)Vi(yi\xi)V 2 (t\xi, x 2 ). Note 
that by replacing H(Y 2 \Q), or more specifically H(Y 2 \Q = q), with a possibly larger term r, 
we have removed the dependence of the region on p(x 2 \q). 

By defining a new auxiliary random variable U to be (U, Q), and noting the fact that 

fffTilQ) < H(Xx) (39) 
H(T\Q) < H(T), (40) 

we obtain yet another capacity region upper bound, which is possibly looser than the previous 
one described in (l36i)-(|38l): 

Rx<H(Y 1 \U)+ 1 -H(Y 1 \X 1 ) (41) 
R 2 <r-H{T\V)- 1 (42) 
< 7 < min(/(y 1 ; U), I(T\ U)) (43) 

for some distribution p(u)p(xi\u) and number 7, where the mutual informations and entropies 
are evaluated using p(u,Xi,yi,t) = p(u)p(xi\u)Vi(yi\xi)V 2 (t\xi,x 2 ). By defining U to be 
(U, Q), we have gotten rid of the time sharing auxiliary random variable Q. 

For notational convenience, we replace U with U, and eliminate variable 7, to get the 
following capacity region upper bound for Z-interference channels satisfying Condition 1, 
which is equivalent to the capacity region upper bound described in (l4TT)- (l43l) : 

R x < I{Y X] X 1 \U) + min(J(y i; U) , I(T\ U)) (44) 
R 2 <t-H{T\U) (45) 
Ri + R 2 <I(Y 1 ;X 1 \U) + t-H(T\U) (46) 

for some distribution p(u)p(xi\u), where the mutual informations and entropies are evaluated 
using 

p(u, xi, 7/1, t) = p(u)p(xi\u)V 1 {y 1 \xi)V 2 (t\x 1 ,x 2 ). (47) 

Next, we will show that when the Z-interference channel satisfies Conditions 1 and 2, the 
capacity region upper bound described by fj4*4l) - (|46j) and the achievable rate region described 
by fl3T|) - (!33|) are the same when evaluated with the same p(u)p(x\\u). 

When evaluated with the same p(u)p(x\\u) , it is obvious that the terms H(Y\\U), H{Yi\Xi) 
and I(U;Yx) are equal in both (Pj)-<Pj) and (j5I|)-(j55|>. H(T\U), respectively H(T), evalu- 
ated with the distribution in (J17|l is equal to H(Y 2 \X 2 = x 2 , U), respectively H(Y 2 \X 2 = x 2 ), 



13 



evaluated with the distribution in (|34|) . Furthermore, evaluated with the distribution in (134|) . 



H(Y 2 \X 2 , U) = ^p*(x 2 )p{u)H(Y 2 \X 2 = x 2 ,U = u) 



(48) 



= ^2p*(x 2 )p(u)H(Y 2 \X 2 = x 2 ,U = u) 



(49) 



= H(Y 2 \X 2 = x 2 ,U), 



(50) 



We also have 



H(Y 2 \X 2 ) = ^P*(^)H(Y 2 \X 2 = x 2 ) = H(Y 2 \X 2 = x 2 ), 



(51) 



where we obtain (|4"9"|) and f[5"Tl) using Condition 1 with n = 1. Thus, we have proved that the 
region described by (f^4|) - (H6l) and that described by (I3~i~l) - (l33l are the same when evaluated 
with the same p{u)p(xi\u) . 

Since the capacity region upper bound and the achievable rate region are taking the 
union of regions described by ( H4T) - (H6l) and (l3TT) - (l33l) . respectively, over all p(u)p(xi\u) , the 
achievable rate region and the capacity region upper bound coincide, yielding the capacity 
region. 

Hence, the capacity region of the class of Z-interference channels that satisfies Conditions 
1 and 2 is 



for some distribution p(u)p(xi\u), where the mutual informations and entropies are evaluated 
with p(u,xi,x 2 , 2/2) = p(u)p(xi\u)p*(x 2 )V l (y 1 \x 1 )V 2 (y 2 \x 1 , x 2 ). Using support lemma [19, 
Lemma 3.4], without loss of generality, we may bound the cardinality of the auxiliary random 
variable U as \U\ < + 1. 

In Figure HI we plot the capacity region and prior upper bounds by Telatar and Tse [16] 
and by Sato [3] for the Z-interference channel based on the model illustrated in Figure [lj 
where p{y\ \x\) is a binary erasure channel (BEC) with erasure probability 0.4 and p{s\x\) is a 
binary symmetric channel (BSC) with crossover probability 0.1. This Z-interference channel 
is based on the model illustrated in Figured! and therefore satisfies both Conditions 1 and 2. 
Without loss of generality, by restricting the cardinality of the auxiliary random variable to 
be 3, we plot the capacity region as given by fl52|) - fl54|) . The dashed region is an inner bound 
of the upper bound given by [16, Theorem 1] where we have omitted time sharing. The 
dot-dash region is an upper bound given by [3, Theorem 2] which outperforms the MAC, 



R x < Y 1 \U) + min(/(C7; Y 1 ), I{U; Y 2 \X 2 )) 

R 2 < T - H(Y 2 \X 2 ,U) 
Ri + R 2 < I{Xu Y X \U) + r - H(Y 2 \X 2 , U) 



(52) 
(53) 
(54) 
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Figure 4: Capacity region and known upper bounds for the Z-interference channel of Figure 
1, where p(yi\xi) describes a BEC with erasure probability .4 and p(s\xi) describes a BSC 
with crossover probability .1 

BC outer bounds in [3]. But for this channel model, the upper bound given by [3, Theorem 
2] is simply equivalent to 

|J {CRi,i22)|i2i</(Jri;ri), R 2 <I(X 2 ;Y 2 \X l )}, (55) 

p(xi),p(x2) 

which is the performance of both users where there is no interference. As can be seen, our new 
capacity region upper bound significantly tightens known upper bounds. More significantly, 
from the proofs in this section, we know that it equals the capacity region. 

7 Conclusions 

We have obtained a single-letter characterization for the capacity region of Z-interference 
channels that satisfy certain conditions. The two conditions are 1) that the performance of 
the second transmitter-receiver pair is invariant to the codeword locations of Transmitter 
2 (i.e. it depends only on codeword distances), and 2) that regardless of the codebook 
choice of Transmitter 1, the input to Receiver 2 can always achieve maximum entropy. The 
results are obtained through a new upper bound on the capacity region that exploits a 
technique by Korner and Marton. This new upper bound is shown to coincide with the 
Han/Kobayashi achievable rate region for channels satisfying the stated conditions. Thus, 
the Han/Kobayashi transmission strategy of superposition encoding and partial decoding is 
capacity-achieving for this class of channels. 

The upper bound also applies to channels satisfying the first but not the second condition. 
This bound significantly improves on existing upper bounds by Teletar/Tse and by Sato, as 





capacity region 




Telatar & Tse Outer 




Sato Outer 
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is illustrated with a numerical example. We also show that this upper bound coincides with 
the achievable rates at certain points on the capacity region, including the sum-rate point, 
for Z-interference channels satisfying the first condition. This upper bounding technique 
may also improve known bounds for other types of interference channels. 

8 Appendix 

8.1 Proof of Theorem [I] 

Since the rate pair (Ri,R 2 ) is achievable, there exist two sequences of codebooks 1 and 2, 
denoted by C" and C 2 , of rate Ri and R 2 , and probability of error less than e n , where e n — > 
as n —>■ oo. Let X™ and X^ be uniformly distributed on codebooks 1 and 2, respectively. 
Let F" be connected via V™ to X™, Y 2 be connected via V 2 to X™ and X 2 . 

We start the converse with Fano's inequality [25] and the data processing inequality [25], 




(56) 
(57) 







(59) 



n 




(60) 



i=l 



it 




(61) 



i=l 



and (16T|) follows from the memoryless nature of V™. We also have 




(62) 
(63) 
(64) 
(65) 






n 




(66) 



n 




(67) 



where ( 1671) follows because conditioning reduces entropy. 
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Let us define another channel, V 2 : X\ — > y 2 , as 

V 2 (t\x 1 ) = V 2 {t\x u x 2 ), (68) 

where x 2 is an arbitrary element in X 2 . Further, let us define another sequence of random 
variables, T n , which is connected via V 2 n , the memoryless channel V 2 used n times, to X™, 
i.e., Ti — > Xu — > T{j}c, Xi{j} C , , F", Y 2 . Also define 22 as the 72,-sequence with x 2 repeated 
n times. It is easy to see that 

H{Y?\X%) = £ l^n 2 H{Y-\X- = x») (69) 

= H(Y?\X% = x n 2 ) (70) 
= H(T n ), (71) 

where (1701) follows from the fact that the channel under consideration satisfies Condition 1, 
and (J7T|) follows from the definition of T n . 
By applying Lemma [IJ we have 

n 

H(T n ) - H(Y{ 1 ) = ]T {H{T t \Yr\ T i+1 ,T i+2 , ■ ■ ■ ,T n ) - H(Y H \Y^ 1 , T l+1 ,T l+2 , ■■■ , T n )) . 

(72) 

Furthermore, since conditioning reduces entropy, we can write 



Ti 



H{Y?) =Y,H{Yii\Yt 1 ) < ( 73 ) 

i=l i=l 
n n 

#(T n ) = E H^T- 1 ) < H{Ti) (74) 

8=1 8=1 
n n 

H(Y?)=Y,H(Y li \Y*- 1 ) > J2H(Y u \Yt\T i+1 ,T i+2 ,--- ,T n ). (75) 



i=l »=l 
Define the following auxiliary random variables, 



Ui = Y{-\T i+1 ,T i+2 ,--- ,T n , i = l,2,-..,n (76) 
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Hence, we have 

1 1 - 

- (H(T n ) - H{Y?)) = - V (H(Ti\Ui) - H(Y lt \U t )) (77) 
n n 

1 - 1 1 n 

- TH^M) < -H{Y?) <-rff(y„) (78) 

n n n z — ' 

i=i t=i 

n 

—H(T n ) < - S~] H(Ti) . (79) 

Further define Q as a random variable that is uniform on the set {1, 2, • ■ ■ , n} and indepen- 
dent of everything else. Also, define the following auxiliary random variables: 

U = U Q , X 1 = X 1Q , X 2 = X 2Q , Y 1 = Y 1Q , T = Tq, Y 2 = Y 2Q . (80) 

Then, we have 

- (H(T n ) - H(Y{ 1 )) = H(T\U,Q) - H(Y 1 \U, Q) (81) 
n 

H(Y 1 \U, Q) < -H(Y?) < H(Y 1 \Q) (82) 
n 

—H(T n ) < H(T\Q). (83) 
n 

Next, we prove some properties of the joint distribution of Q, U, X 1 , X 2 , Yi, T and Y 2 . 
First, we have 

Pr[X 2 = x 2 \Xi = xi,U = u,Q = i] (84) 

= Pr[X 2i = x 2 \X u = x l ,U i = u,Q = i] (85) 

= Vi[X 2l = x 2 \X u = x 1 ,{Yt\T i+u T i+2 ,--- ,T n )=u,Q = i] (86) 

= Pr[X 2 , = x 2 \X u = x u (Yt\T i+1 ,T i+2 , ■ ■ ■ ,T n ) = u] (87) 

= Pi[X 2l = x 2 ] (88) 

= Pi[X 2 = x 2 \Q = t], (89) 

where flHTj) and fl89l) are because Q is independent of everything else, and therefore can 
be dropped from the conditioning, and flHHl) follows because X 2i is a function of message 
W 2 , while (Xu, F^ -1 , T i+1 , T i+2 , • - • , T n ) are functions of message W\. Since Wi and W 2 are 
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independent, so is X 2i and (Xu, Y[ , Ti + \,T i+2 , • • • , T n ). We also have 

Pr[Y 1 = y 1 \X 1 = x 1 ,X 2 = x 2 ,U = u,Q = i] (90) 

= Pt[Yu = y^Xu = x x ,X 2i = x 2 ,Q = i, Ui = u] (91) 

= Pt[Y u = yi \X u = xi, X 2i = x 2 , Q = i, {Yt\ T i+1 ,T t+2 , ■ ■ ■ , T n ) = u] (92) 

= PrfKii = yi\X u = x 1} X 2i = x 2 , (Y^" 1 , T i+1 ,T i+2 , ■ ■ ■ ,T n ) = u] (93) 

= Pi[Y li = y 1 \X ll = x l } (94) 

= V 1 (y 1 \x 1 ), (95) 

where ( !93|) follows by the same reason as ( 1871) . and (l94l follows from the memoryless nature 
of V7 1 . We further have 

(96) 
(97) 

Vi] (98) 
(99) 
(100) 
(101) 

where ( 199]) follows by the same reason as (IHTj) . and fllOOp follows from the definition of T n , 
more specifically, the memoryless nature of V 2 n and the fact that Tj — > — > T{j}c, X 2i , K", 
for z = 1, 2, • • • , n. Finally, we have 

Pt[Y 2 = y 2 \Xx = Xl ,X 2 = x 2 ,U = u,Q = t,T = t,Y l= Vl ] (102) 

= Pr[Y 2i = y 2 \Xii = x 1 ,X 2i = x 2 ,U { =u,Q = i,T { = t,Y u = y x ] (103) 

= Pi\Y 2i = y 2 \X u = x 1 ,X 2i = x 2 , (Y^ 1 ,T i+ i,T i+2 , ■ ■ ■ ,T n ) = u,Q = z, T f = t, Y u = y x ] 

(104) 

= Pi\Y 2i = y 2 \X Xi = Xi,X 2i = x 2 , (YI~ l ,T i+1 ,T i+2l ■ ■ ■ ,T n ) = u,Ti = t,Y u = y x ] (105) 
= Pr[Y 2i = y 2 \X li = x 1 ,X 2i = x 2 ] (106) 
= V 2 (y 2 \x 1 ,x 2 ), (107) 

where (11051) follows by the same reason as (IHTj) , and fl 1 6 [) follows from the memoryless nature 
of V 2 . Thus, based on ( 1891 . ( 1951) . f llOip and (11071) . we have shown that the joint distribution 
of Q, U, X±, X 2 , Yi, T and Y 2 satisfies 

p(q, u, x x , x 2 ,y x , t, y 2 ) = p(q)p(u, Xi|g)p(x 2 |g)Vi(yi|xi)V2(£|xi, x 2 )V 2 (y 2 \x u x 2 ). (108) 



Pr[T = t\x x = 


xi,X 2 


= x 2 ,U = u,Q = 


= i,Yi = y x ] 






= Pr[T, = t\X x 


A Xl; 


X 2 % = x 2 , Q = i. 


Ui = u, Yu 


= yi] 




= Pr[T, = t\X, 


i X^. 


X 2 i = x 2 , Q = i. 


(Xl >^i+l: 


> Ti+ 2 , ■ 


■ ■ , T n ) = u, Y x 


= Pr[T, = t\Xi 


i Xl, 


X 2 i = x 2 , (Y x , 


Ti+\, T i+2 , ■ 


■■ ,T n ) 


= u, Y u = y x ] 


= Pr[T, = t\X x 


A = Xl] 










= V 2 (t\x u x 2 ), 
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From (!HT!) - (!83|) . we may conclude that there exists a number 7 that satisfies 

< 7 < min(/(F i; U\Q),I(T; U\Q)) (109) 

such that 

-H(T n ) = H(T\U, Q) + 7, ~H{Y?) = H(Y X \U, Q) + 7. (110) 
n n 

From (1931) . we see that 

Pr[Y! = y 1 \X 1 = Xu Q = i\ = V x (y x \x x ), (111) 

which means 

1 - 

H(Y X \X X ) = H(Y X \X X ,Q) = -Y j H{Y l \X l ). (112) 

By combining ((SI}, jSZJ, (ED, (TT08T) . (TOl) . ffTTOj) and (1TT21) . and allowing n to approach 
infinity, we obtain the desired result. 

8.2 Proof of Theorem [3] 

Because the distribution in Theorem [1] satisfies the Markov chain (Q,U) — > X x — > Y x , we 
have 

H(y 1 |X 1 ) = J H-(Y 1 |X l! C/,Q). (113) 

Hence, comparing with the result of Theorem [T], in order to prove this theorem, it is sufficient 
to show that for any p(q)p(x x , u\q)p(x 2 \q), we have 

H{T\U,Q) = H(Y 2 \X 2 ,U,Q) (114) 
H(T\Q) = H(Y 2 \X 2 ,Q), (115) 

where entropies are evaluated with distribution 

p(q, u, x x , x 2 ,y x , t, y 2 ) = p(q)p{x x , u\q)p(x 2 \q)V x (y x \x x )V 2 (t\x x , x 2 )V 2 (y 2 \x x ,x 2 ). (116) 

With the distribution in (11161) . we have 

H(T\U = u,Q = q) = H{Y 2 \X 2 = x 2 ,U = u,Q = q) (117) 
H(T\Q = q) = H(Y 2 \X 2 = x 2 ,Q = q). (118) 
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Moreover, 

H(T\U, Q) = ^p{q)p(u\q)H(T\U = u,Q = q) (119) 

q,U 

H(T\Q) = J2p(q)H(T\Q = q). (120) 

In addition, 

H(Y 2 \X 2 , U,Q)=J2 p(q)p(u\q)p(x 2 \q)H(Y 2 \X 2 = x 2 ,U = u,Q = q) (121) 

q,u,X2 

= P{q)p{u\q)p{x 2 \q)H{Y 2 \X 2 = x 2 ,U = u,Q = q) (122) 

q,u,X2 

= ^p(g)p(M|g)//(F 2 |X 2 = x 2 ,U = u.Q = q) 5>(x 2 |g) (123) 



q,u X2 



Y J P{q)p{u\q)H{Y 2 \X 2 = x 2 ,U = u,Q = q) (124) 



q,U 



and 



H(Y 2 \X 2 ,Q) = ^p(q)p(x 2 \q)H(Y 2 \X 2 = x 2 ,Q = q) (125) 
= ^2p(q)p(x 2 \q)H(Y 2 \X 2 = x 2 ,Q = q) (126) 



(/■■'■ 2 



= £) P ( g )#(y 2 |x 2 = x 2 , q = q ) 5>(x 2 | g ) ( 127 ) 

q X2 

= Y,P(q)H{Y 2 \X 2 = x 2l Q = q), (128) 
i 

where we obtain ffT22j) and (TT26j) using Condition 1 with n = 1. Using ffTTTD . (jTTgj) . ffTT9|) . 
(TT20D . (HMD and (IT281 . we obtain (ITT4j) and (fTT5D . and thus have proved Theorem H 
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